Streptavidin-binding peptide

ABSTRACT

The invention relates to novel streptavidin binding peptides.

FIELD OF THE INVENTION

The invention relates to a streptavidin-binding peptide and to methods for the production of a streptavidin-binding peptide in a cell-based or cell-free protein biosynthesis system.

Further, the invention relates to the use of a streptavidin-binding peptide for purifying a defined protein produced in a protein biosynthesis system, and to the use of a streptavidin-binding peptide for marking a defined protein.

PRIOR ART

Methods for efficiently expressing defined proteins in various pro- and eukaryotic organisms are known in the art and do not require further explanations. As defined proteins are understood here peptides and/or proteins, which are expressed naturally or after transformation or use of defined RNA in the organism used for the expression or cell-free expression system and are enriched in purification steps.

Methods for the cell-free expression of defined proteins are for instance known in the art from documents EP 0312 617 B1, EP 0401 369 B1 and EP 0593 757 B1. According thereto, the components necessary for a transcription and/or translation are incubated, in addition to a nucleic acid strand coding for a defined protein, in a reaction vessel and after the expression the polypeptides/proteins are isolated from the reaction solution. The components necessary for the transcription as well as for the translation can be obtained from the supernatants of pro- or eukaryotic cell lysates after a centrifugation.

An essential problem for the expression of defined proteins in pro- and eukaryotic organisms and for the cell-free expression is the purification and/or the detection of the expressed defined proteins. This is particularly problematic with defined proteins, for which there are no antisera or monoclonal antibodies. For simplifying the purification and the detection of such defined proteins, they are expressed as so-called fusion proteins. Hereby, an amino acid sequence is added N- and/or C-terminally to the defined protein or inserted between two protein domains (internally), the fusion partner. This happens on the nucleic acid level, so that the defined protein as well as the fusion partner added for the detection and/or purification are expressed together during a transcription/translation process as a chimeric (fusion) protein, consisting of the defined protein and the fusion partner. The mentioned fusion partner may be a single amino acid, but also a peptide or protein. For these added fusion partners, there are available binding partners immobilized for purification, and the fusion proteins can be isolated with these binding partners. In addition to the possibility of the purification of the proteins, they may also be detected with binding partners being specific for the fusion partner. These binding partners may be antibodies specific for the fusion partner or also other proteins, peptides or chemical compounds specifically binding to the fusion partner.

Examples for such fusion partners are the Myc-tag (Munro & Pelham, Cell 46:291-300, 1986; Ward et al., Nature 341:544-546, 1998), the FLAG peptide (Hopp et al., Bio/Technology 6:1204-1210, 1988), the KT3 epitope peptide (Martinet et al., Cell 63:843-849, 1990; Martin et al., Science 255:192-194, 1992) and the alpha-tubulin epitope peptide (Skinner et al., J. Biol. Chem. 266:14163-14166, 1991), all of which were successfully used for the detection and in part also for the purification of proteins. For part of the fusion partners normally having a length of 3 to 12 amino acids, it could be shown that they do not affect the biological function of the defined proteins. The biological function of the defined protein is however affected with increasing length of the fusion partner, since the additionally expressed amino acids can affect e.g. the development of the secondary, tertiary and/or quaternary structure. Longer fusion partners are thus suitable for the detection of the proteins, but less for the purification of the proteins. On the other hand, longer fusion partners frequently have a higher affinity with their specific binding partners.

A substantial drawback of the above fusion partners with respect to the purification is that the binding to the binding partner is based on an antigen/antibody binding, and the production and purification of the antibodies used as binding partners is time-consuming and expensive. Another drawback is that the antigen/antibody binding requires a very strong binding between the binding partner, e.g. antibody immobilized at a matrix, and the fusion partner. This has as a consequence that for the elution of the fusion protein bound via the fusion partner, sometimes extremely unphysiological conditions referred to the defined protein have to be generated. Unphysiological conditions are here e.g. very high or extremely low salt concentrations, use of chaotrophic salts and pH values differing by far from the natural pH value of the defined protein. This may affect, under certain circumstances, the structure and/or the functionality of the defined protein or irreversibly destroy it. Correspondingly, the purification of the defined proteins should take place under as gentle and physiologic conditions as possible, in order to maintain the functionality of the proteins. For three of the above fusion partners (Hopp et al., 1988; Martin et al., 1990; Skinner et al., 1991), an elution could even be achieved under gentle conditions by means of competitive peptides, however there is still the problem of the time-consuming and expensive antibodies to be purified (and serving as binding partners) and the binding thereof to the matrix.

Further fusion partners consisting of 8 to 9 amino acids are known in the art from the documents U.S. Pat. No. 5,506,121 and Schmidt & Skerra (Protein Engineering, vol. 6, no. 1, 109-122, 1993). The fusion partners disclosed therein are capable to bind to the streptavidin or to the “core” streptavidin, a proteolytically cleaved product of the streptavidin (Bayer et al., Biochem. J. 259:369-376, 1989).

All known fusion partner binding to the streptavidin contain the amino acid sequence HPQ, the so-called HPQ binding motif coming into interaction with the biotin-binding pocket of the streptavidin. To the already known peptides belong the so-called Strep-tag I: AWRHPQFGG with a dissociation constant Kd of 10-37 μM, the so-called Strep-tag II: WSHPQFEK with a dissociation constant Kd of 18-72 μM, and the so-called SBP-tag: MDEKTTGWRGGHVVEGLAGELEQLRARLEHHPQGQREP with a dissociation constant Kd of 2.5 nM. In contrast to the two short Strep-tags I and II, the longer SBP-tag has a substantially stronger binding to the streptavidin. However, as mentioned above, the function of the defined protein is in particular affected with increasing length of the fusion partner. Further, particularly long fusion partners are disturbing the crystallization of the defined proteins.

TECHNICAL OBJECT OF THE INVENTION

It is the technical object of the invention to provide an as short as possible streptavidin-binding peptide having a strong binding to the streptavidin.

Basics of the Invention and Preferred Embodiments

For achieving the technical object, the invention relates to a streptavidin-binding peptide comprising or consisting of an amino acid sequence according to Seq.-ID 1-12. By the streptavidin-binding peptide according to the invention comprising an amino acid sequence according to Seq.-ID 1-12 it is achieved, compared to prior art, a substantially stronger binding between the streptavidin-binding peptide and the streptavidin, or the streptavidin-binding peptide can substantially be shorter with the same binding strength.

Further, the invention relates to a nucleic acid coding for a streptavidin-binding peptide according to Seq.-ID 1-12 and a plasmid comprising a nucleic acid according to the invention. It is understood that the nucleic acid coding for a streptavidin-binding peptide according to the invention and/or the plasmid can be adjusted to the respective expression system/protein biosynthesis system. The plasmid may be adapted as an expression vector, in particular for bacteria, comprising a region with at least one interface for a restriction enzyme, in which the nucleic acid sequence coding for a defined protein may be inserted, and thus the defined protein is expressed together with the streptavidin-binding peptide according to Seq.-ID 1-12. It is understood that the region coding for the defined protein and for the streptavidin-binding peptide according to Seq.-ID 1-12 are under the control of a suitable promoter and/or operator and terminator. The region with at least one interface for at least one restriction enzyme may lie in the 5′ as well as 3′ direction of the nucleic acid region coding for the streptavidin-binding peptide according to Seq.-ID 1-12. The nucleic acid region coding for the streptavidin-binding peptide according to Seq.-ID 1-12 needs not to follow immediately to the nucleic acid region coding for the defined protein, rather there may be nucleic acids coding for 1 to 20 amino acids, in particular for 5 to 10 amino acids, between the two regions.

The invention also relates to a method for the production of a streptavidin-binding peptide according to Seq.-ID 1-12, wherein a nucleic acid is expressed or overexpressed in a cell-based or cell-free protein biosynthesis system. The thus produced peptide can easily be isolated via the binding to the streptavidin. The obtained translation product, i.e. a streptavidin-binding peptide according to Seq.-ID 1-12, is contacted with immobilized streptavidin and is bound thereto. After separation of the solution with substances not bound to streptavidin, the translation product is eluted. As an elution agent, buffers can be used, which contain biotin or related substances and/or derivatives, such as iminobiotin, desthiobiotin and/or diaminobiotin. The obtained streptavidin-binding peptide according to Seq.-ID 1-12 carries in the case of the fusion with the defined protein this protein. It may also be used independently from a defined protein for the antibody production. The obtained antibodies can be used e.g. for the detection or purification of the streptavidin-binding peptides according to Seq.-ID 1-12 or in the case that the streptavidin-binding peptide according to Seq.-ID 1-12 is employed as a fusion partner, of the defined protein bound to this fusion partner.

It is understood that the production of such a streptavidin-binding peptide containing an amino acid sequence according to Seq.-ID 1-12 is also possible by means of chemical solid phase synthesis, for instance with an automatic synthesis system made by Abimed (Langenfeld, Germany). This method is based on the standard protocols of the Fmoc chemistry (Fmoc=9-fluorenylmethyloxycarbonyl).

The invention also relates to the use of a streptavidin-binding peptide according to Seq.-ID 1-12 for the purification of a defined protein produced in a protein biosynthesis system, wherein a nucleic acid coding for the protein and, connected therewith, for the streptavidin-binding peptide is subjected to a transcription and/or translation, wherein a solution comprising the thus obtained translation product is contacted with immobilized streptavidin and is bound thereto, and wherein after separation of the solution with substances not bound to streptavidin the translation product is eluted. The elution may take place under gentle conditions for the defined protein. As an elution agent, buffers can be used, which contain biotin or related substances and/or derivatives, such as iminobiotin, desthiobiotin and/or diaminobiotin. The produced defined protein may comprise N and/or C terminally the streptavidin-binding peptide according to Seq.-ID 1-12 serving as a fusion partner.

If a streptavidin-Sepharose column is used, the defined protein to be investigated can be immobilized at the matrix by means of the streptavidin-binding peptide according to Seq.-ID 1-12 serving as a fusion partner and isolated from a mixture of molecules, e.g. a cell lysate.

The invention also relates to the use of a streptavidin-binding peptide according to Seq.-ID 1-12 for labeling a defined protein, wherein a nucleic acid coding for the protein and, connected therewith, for the streptavidin-binding peptide is subjected to a transcription and/or translation, wherein the thus obtained translation product is contacted with a streptavidin conjugate comprising a reporter molecule and bound thereto. The labeled defined protein may comprise N and/or C terminally the streptavidin-binding peptide according to Seq.-ID 1-12 serving for labeling. Reporter molecules may for instance be radioactive and/or radioactively labeled substances. It is understood that the streptavidin as such may also be radioactive and/or radioactively labeled; in this case a reporter molecule is not needed. Reporter molecules may also be fluorescent substances or enzymes, such as alkaline phosphatase or peroxidase. By such streptavidin compounds coupled to a reporter molecule, for instance proteins, which have the streptavidin-binding peptide according to Seq.-ID 1-12 as a fusion partner, can be detected and/or quantified on a Western blot or in the ELISA (enzyme linked immunosorbent assay). If the defined proteins are binding proteins, other proteins binding thereto can also be detected in this way. Binding proteins are in this context proteins, which themselves bind other proteins and/or themselves bind to other proteins, such as in an antigen/antibody binding or in a binding of a protein to a receptor.

Preferred is a streptavidin-binding peptide comprising less than 30 amino acids, preferably less than 20 amino acids, most preferably less than 10 amino acids.

In the following, the invention is explained in more detail, based on figures and examples representing embodiments only.

FIG. 1: purification of FABP with streptavidin-binding peptide according to motif 3 as a fusion partner after cell-free protein biosynthesis via a streptavidin affinity column. A comparable amount of each fraction was analyzed by means of SDS polyacrylamide gel electrophoresis. (A) Coomassie stained and (B) autoradiogramme. The samples in the numbered tracks are (1) molecular weight markers; (2) reaction mixture; (3) passage of the sample application; (4-6) washing fractions; (7-9) elution fractions; (10) radioactive molecular weight marker.

FIG. 2: purification of FABP with streptavidin-binding peptide according to motif 3 as a fusion partner after cell-free protein biosynthesis via a StrepTactin affinity column. A comparable amount of each fraction was analyzed by means of SDS polyacrylamide gel electrophoresis. (A) Coomassie stained and (B) autoradiogramme. The samples in the numbered tracks are (1) molecular weight markers; (2) passage of the sample application; (3-5) washing fractions; (6-10) elution fractions; (11) radioactive molecular weight marker.

EXAMPLE 1 Binding Peptides According to the Invention and Comparison Peptides with Dissociation Constants

In a for this technology usual way, the FAB protein (fatty acid binding protein) from bovine hearts was expressed in vitro in a cell-free protein biosynthesis by means of a coupled transcription/translation system. The expressed FAB protein had at the N terminus an additional streptavidin-binding peptide composed of 15 amino acids with different amino acid sequences as fusion partners.

Two binding peptides contain in their amino acid sequence the HPQ motif (motif 1 and 2) described in the prior art, and two of them contain the streptavidin-binding peptide according to Seq.-ID 1-6 according to the invention (motif 3 and 4). The sequences of the peptides are shown in Table 1. In addition, the expressed proteins were provided at the C terminus with a “His-tag” consisting of 6 histidines. The proteins were purified by means of affinity chromatography via Ni²⁺ IDA-agarose in a usual way. It can be seen that with equal length, binding peptides of a sequence according to the invention have a substantially lower dissociation constant than a binding peptide with HPQ motif. The Kd value of a binding peptide according to the invention is in the order of the SBP-tag in spite of the 2.5 fold length of the SBP-tag. TABLE 1 Dissociation Sequence constant [kd] motif 1 D L Y D I D R N W V G H P Q G  8 μM motif 2 D N Y D A D L A W D T H P Q D  70 μM motif 3 D V E A W L D E R V P L V E T  84 nM motif 4 D V E A W I A D P A V H F T T 200 nM

EXAMPLE 2 Kind of Measurement of the Dissociation Constant of Table 1

The measurements of the dissociation constant were performed with a BiacoreX system and the sensor chip NTA of the company Biacore. The proteins generated during the expression described in Example 1 were investigated, i.e. FAB proteins, which had at the N terminus a peptide consisting of 15 amino acids as fusion partners and at the C terminus 6 histidines, which were needed for the immobilization on the sensor chip. The binding affinity of the proteins generated during the expression to streptavidin described in Example 1 was measured in the Biacore device according to manufacturer's instructions. The protein to be investigated is on the sensor chip, and a streptavidin solution having a defined concentration is sprayed in. The interaction (binding) between the two molecules is measured by the device and quantified as so-called resonance units (RU). For the measurement, the buffers specified by the manufacturer were used.

The results of the measurements are shown in Table 2. The obtained measurements were evaluated with the provided software and led to the dissociation constants shown in Table 1. TABLE 2 Motif 1 2 3 4 Streptavidin conc. RU RU RU RU 15 nM  98 30 nM 242  68 60 nM 461 171 125 nM 613 280 250 nM 704 384 500 nM  62 786 478 1 μM 123 — 560 2 μM 233 946 644 3 μM —  64 — 4 μM 407 — 983 6 μM —  98 8 μM 621 — 15 μM — 201 16 μM 779 — 30 μM — 337 32 μM 955 — 60 μM 536

EXAMPLE 3 Purification of a Fusion Protein

In a usual way the FAB protein having at the N terminus an additional peptide consisting of 15 amino acids with the amino acid sequence DVEAWLDERVPLVET (motif 3) as fusion partners, is expressed in vitro in a cell-free protein biosynthesis by means of a coupled transcription/translation system. For purification of the overexpressed defined protein, the streptavidin was coupled to a solid phase. As a solid phase was used a Sepharose. The expressed FAB protein was then purified in a usual way via a column comprising streptavidin-Sepharose or StrepTactin-Sepharose. For the purification the following buffers were used: washing buffer (100 mM Tris-HCl (pH 8.0), 150 mM NaCl, 1 mM EDTA).

For the elution of streptavidin, the washing buffer contained 2 mM biotin, and for the elution of StrepTactin, the washing buffer contained 2.5 mM desthiobiotin.

The percentage distribution of the used defined protein on the various fractions of the affinity chromatography can be taken from Table 3. In Table 3, D represents the application of sample/passage, W washing fraction, and E elution fraction. TABLE 3 Fraction D W1 W2 W3 E1 E2 E3 E4 E5 Tot. Streptavidin 3.5% 6.8% 1.0% 0.3% 0.4% 76.9% 0.7% — — 89.6% StrepTactin 3.0% 7.4% 1.8% 0.9% 0.9% 45.9% 23.2% 1.0% 0.3% 84.4%

When using streptavidin-Sepharose, 90% of the applied protein could be recovered from the column, 78% for the eluate. The quality of the purification is shown in FIG. 1.

When using StrepTactin-Sepharose, 84% of the applied protein could be recovered from the column, 71% for the eluate. The quality of the purification is shown in FIG. 2.

EXAMPLE 4 Optimization of Individual Binding Peptides

The binding peptide (FAx 3) with the sequence DVEAWLDERVPLVET was subjected to a substitution analysis, and it was intended thereby to identify a possible minimum motif (=shortened peptide) and examine the effect of individual amino acid exchanges on the binding specificity. In a substitution analysis, peptides are synthesized by means of spot methods (Frank, R. 1992) on a cellulose membrane serving as a solid phase. Every position of the peptide comprising 15 amino acids was systematically replaced by the other 19 L amino acids.

The substitution analysis was incubated with a peroxidase-labeled streptavidin and developed with a luminescence substrate. Thereby it was possible to find out, how far the peptide can be shortened and which amino acids are “conserved” and thus not or only very poorly exchangeable. The following picture resulted: a peptide, which has sufficient streptavidin binding, is the 6-mer peptide DVEAWL. A clearly better streptavidin binding has the 9-mer peptide DVEAWLDER. The substitution analysis also indicated positions, where the exchange of the original amino acid by another one could result in an improvement of the streptavidin binding.

The information supplied by the substitution analysis was verified by that the binding constants of different peptides were determined by surface plasmon resonance spectroscopy using a BIAcore device. The measurements were made with the NTA sensor chip of the company BIAcore and by means of the fatty acid-binding protein from bovine hearts (FABP). The FABP served for the immobilization and presentation of the peptide. Further, it was intended to thus simulate the conditions occurring in an affinity chromatography, since the peptide is to be used in the future as an affinity peptide for the purification of proteins. In contrast to the first measurements, where the peptides to be verified replaced the four N terminal amino acids of the FABP, the arrangement of the peptides was this time also N terminal, but they were not part of the FABP. The His-tag necessary for the immobilization was also modified. It was located same as for the first measurements at the C terminus of the FABP, was however extended from six to eight histidines and further spaced by a linker consisting of two glycines from the FABP. Thereby, the immobilization of the ligands could distinctly be improved, and thus the drift of the base line resulting from the washing-off of the ligand could be minimized. For every measurement, an identical amount of the respective FABPs was immobilized, what corresponded to a rise of the base line by 1,100 resonance units (RU). As a reference was used the FABP with the His-tag, but without N terminal binding peptide.

The binding constants obtained with the optimized measuring method were the following: Kd Peptide (FAx 3): D V E A W L D E R V P L V E T  3.6 ± 0.6 nM (AspValGluAlaTrpLeuAspGluArgValProLeuValGluThr) Shortened peptide (9-mer): DVEAWLDER 240 ± 40 nM Influence of other amino acids on the shortened 9-mer peptide: 3 Asp: DVDAWLDER 940 ± 140 nM 3 Gly: DVGAWLDER 570 ± 90 nM 6 Phe: DVEAWFDER not measurable 6 Arg: DVEAWRDER not measurable 7 Gly: DVEAWLGER  17 ± 4 nM 8 Ala: DVEAWLDAR 160 ± 30 nM A positive influence had in particular glycine at position 7 and alanine at position 8. Combination of these two amino acids:

-   Shortened peptide: DVEAWLGAR 17±4 nM -   Original peptide with Gly7: DVEAWLGERVPLVET4.2±0.7 nM -   Original peptide: DVEAWLGARVPLVET 4.1±0.6 nM

The original 15-mer peptide cannot further be improved. The shortened 9-mer peptide however can distinctly be improved by a glycine in lieu of the asparagic acid at position 7. The alanine in lieu of the glutamine acid at position 8 also has a slightly better binding affinity with regard to streptavidin. By the combination of these two amino acids the binding affinity cannot further be improved, however it will not become worse either.

The invention thus also comprises a 9-mer peptide with its not or only slightly conserved positions:

DVXAWLXXR (X=an arbitrary amino acid)

Further of course the shortened peptide with glycine at position 7:

-   -   DVEAWLGER

and the shortened peptide with glycine at position 7 and alanine at position 8:

-   -   DVEAWLGAR

EXAMPLE 5 Variants of Sequences According to the Invention

In the following, particularly suited sequences according to the invention are given. Seq.-ID 1: DVEAW Seq.-ID 2: DVEA Seq.-ID 3: VEAW Seq.-ID 4: DVE Seq.-ID 5: VEA Seq.-ID 6: EAW Seq.-ID 7: DVXAW Seq.-ID 8: DVXAWL Seq.-ID 9: DVXAWLX Seq.-ID 10: DVXAWLXX Seq.-ID 11: DVXAWLXXR Seq.-ID 12: DVXAWLXXRVPLVET Seq.-ID 13: VPLVET

X at position 3 may be arbitrary, however in particular E, D or G. X at position 7 may be arbitrary, however in particular G or D. X at position 8 may be arbitrary, however in particular A or E. Sequence 13 is an optional carboxy-terminal extension of the sequence 11, and from sequence 13 1 to 6 amino acids may be connected, beginning amino-terminally. 

1. A streptavidin-binding peptide comprising or consisting of an amino acid sequence according to SEQ ID NOS: 1-12.
 2. A nucleic acid coding for a streptavidin-binding peptide according to claim
 1. 3. A plasmid comprising a nucleic acid according to claim
 2. 4. A method for the production of a streptavidin-binding peptide according to claim 1, wherein a nucleic acid according to claim 2 is expressed or overexpressed in a cell-based or cell-free protein biosynthesis system.
 5. A method for the purification of a defined protein produced in a protein biosynthesis system, wherein a nucleic acid coding for the protein and, connected therewith, for the streptavidin-binding peptide of claim 1, controlled by a regulatory sequence, is subjected to a transcription or translation, wherein a solution comprising the thus obtained translation product is contacted with immobilized streptavidin, and wherein after separation of the solution with substances not bound to the streptavidin the translation product is released from the streptavidin and eluted.
 6. A method for labeling a defined protein, wherein a nuclic acid coding for the protein and, connected therewith, for the streptavidin-binding peptide according to claim 1 is subjected to a transcription or translation, wherein the thus obtained translation product is contacted with a streptavidin conjugate comprising a reporter molecule and is bound thereto. 